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Abstract — Recently, network error correction coding (NEC) 
has been studied extensively. Several bounds in classical coding 
theory have been extended to network error correction coding, 
especially the Singleton bound. In this paper, following the 
research line using the extended global encoding kernels proposed 
in fl2l . the refined Singleton bound of NEC can be proved 
more explicitly. Moreover, we give a constructive proof of the 
attainability of this bound and indicate that the required field 
size for the existence of network maximum distance separable 
(MDS) codes can become smaller further. By this proof, an 
algorithm is proposed to construct general linear network error 
correction codes including the linear network error correction 
MDS codes. Finally, we study the error correction capability of 
random linear network error correction coding. Motivated partly 
by the performance analysis of random linear network coding 
1 6 1, we evaluate the different failure probabilities defined in this 
paper in order to analyze the performance of random linear 
network error correction coding. Several upper bounds on these 
probabilities are obtained and they show that these probabilities 
will approach to zero as the size of the base field goes to infinity. 
Using these upper bounds, we slightly improve on the probability 
mass function of the minimum distance of random linear network 
error correction codes in |7|, as well as the upper bound on 
the field size required for the existence of linear network error 
correction codes with degradation at most d. 

Index Terms — Network coding, network error correction cod- 
ing, the refined Singleton bound, maximum distance separable 
(MDS) code, random linear network error correction coding, the 
extended global encoding kernels, network error correction code 
construction. 



I. Introduction 

NETWORK coding was first introduced by Yeung and 
Zhang in HI and then was profoundly developed by 
Ahlswede et al. @. In the latter paper [2], the authors showed 
that by network coding in network communication, the source 
node can multicast the information to all sink nodes at the 
theoretically maximum rate as the alphabet size approaches 
infinity, where the theoretically maximum rate is the smallest 
minimum cut capacity between the source node and any 
sink node. Li et al. J3) indicated that linear network coding 
with finite alphabet size is sufficient for multicast. In fl4), 
Koetter and Medard presented an algebraic characterization 
for network coding. Although network coding can achieve the 
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higher information rate than classical routing, Jaggi et al. 
still proposed a deterministic polynomial-time algorithm for 
constructing a linear network code. Random linear network 
coding was originally introduced by Ho et al. (6|, and the 
authors analyzed the performance of random linear network 
coding by studying the failure probabilities of the codes. Balli, 
Yan, and Zhang [7| improved on the upper bounds on these 
failure probabilities and then studied the asymptotic behavior 
of the failure probability as the field size goes to infinity. 
Following [7 1, Guang and Fu [8] gave some tight or asymp- 
totically tight bounds for two kinds of failure probabilities and 
also gave the specific network structures in the worst cases. 

Network coding has been extensively studied for several 
years under the assumption that channels of networks are error- 
free. Unfortunately, all kinds of errors may occur in network 
communication such as random errors, erasure errors (packet 
losses), errors in headers and so on. In order to deal with such 
problems, network error correction coding (NEC) was studied 
recently. The original idea of network error correction coding 
was proposed by Yeung and Cai in their conference paper 
||9l and developed in their recent journal papers iflOl ifTTI . In 
the latter two papers, the concept of network error correction 
codes was introduced as a generalization of the classical error 
correction codes. They also extended some important bounds 
from classical error correction codes to network error correc- 
tion codes, such as the Singleton bound, the Hamming bound, 
and the Gilbert- Varshamov bound. Although the Singleton 
bound has been given in Cai and Yeung IflOl . Zhang lfl2ll 
and Yang et al. Ifl3ll (14| presented the refined Singleton 
bound independently by using the different methods. Yang et 
al. lfT31 lfT4l developed a framework for characterizing error 
correction/detection capabilities of network error correction 
codes. They defined different minimum distances to measure 
error correction and error detection capabilities, respectively. 
It followed an interesting discovery that, for nonlinear network 
error correction codes, the number of the correctable errors can 
be more than half of the number of the detectable errors. In 
fT2l . Zhang defined the minimum distance of linear network 
error correction codes and introduced the concept of extended 
global encoding kernels. Using this concept, Zhang proposed 
linear network error correction codes in packet networks. 
Besides coherent networks, this scheme is also suitable to non- 
coherent networks by recording the extended global encoding 
kernels in the headers of the packets. Moreover, the extended 
global encoding kernels are used to form the decoding matrices 
at sink nodes. As well as in fl6l , the decoding principles and 
decoding beyond the error correction capability were studied. 
The authors further presented several decoding algorithms and 
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analyzed their performance. In addition, Balli, Yan, and Zhang 
171 studied the error correction capability of random linear 
network error correction codes. They gave the probability mass 
function of the minimum distance of random linear network 
error correction codes. For the existence of a network error 
correction code with degradation, the upper bound on the 
required field size was proposed. 

In [17 1, Koetter and Kschischang formulated a different 
framework for network error correction coding. In their ap- 
proach, the source message is represented by a subspace of a 
fixed vector space and a basis of the subspace is injected into 
the network. This type of network error correction codes is 
called subspace codes. 

In this paper, we follow the research line using the extended 
global encoding kernels introduced by Zhang in [18] [12|. 
We reprove the refined Singleton bound of the network error 
correction codes more explicitly by using the concept of the 
extended global encoding kernels. Similar to the Singleton 
bound in classical coding theory, the refined Singleton bound 
is also tight and those linear network error correction codes 
achieving this bound with equality are called linear network 
error correction maximum distance separable (MDS) codes, 
or network MDS codes for short. For network MDS codes, 
Zhang 1 12] gave an existence proof by an algebraic method. In 
this paper, we present a constructive proof of the attainability 
of the refined Singleton bound, and indicate that the required 
field size for the existence of network MDS codes can be- 
come smaller (in some cases much smaller) than the known 
results. Moreover, by this proof, we design an algorithm for 
constructing general linear network error correction codes, in 
particular, network MDS codes. 

Matsumoto [19| and Yang et al. lTT~3l also proposed the al- 
gorithms for constructing network MDS codes. The algorithm 
of Yang et al. designs the codebook and the local encoding 
kernels separately. On the contrary, Matsumoto's algorithm 
and our algorithm design them together. As noted above, 
the required field size of our algorithm is smaller. Moreover, 
compared with Matsumoto's algorithm, our algorithm needs 
less storages at each sink node. For the decoding, as mentioned 
in |[T9ll , the decoding of Matsumoto's algorithm requires ex- 
haustive search by each sink node for all possible information 
from the source and all possible errors, and our algorithm 
can make use of the better and faster decoding algorithms 
proposed by Zhang, Yan, and Balli in a series of papers |18|, 
fl2l . and |16| such as the brute force decoding algorithm and 
the fast decoding algorithm. For the case of decoding network 
error correction codes beyond the error correction capability 
in packet networks IIT61 . our algorithm has more advantages 
because of the use of extended global encoding kernels. 
We further study the error correction capability of random 
linear network error correction coding, and analyze the failure 
probabilities of constructing network MDS codes and general 
network error correction codes by using random method, as 
well as the probability mass function of the minimum distance 
and the required field size. 

This paper is divided into 6 sections. In the next section, we 
introduce the basic notation and definitions in linear network 
coding and linear network error correction coding, and give 



some propositions needed in this paper. In Section III, we 
reprove the refine Singleton bound by using the concept of the 
extended global encoding kernels, and propose a constructive 
proof to show the attainability of the refined Singleton bound 
of NEC. Consequently, we also indicate that the required field 
size for the existence of network MDS codes can become 
smaller than the known results. Section IV is devoted to 
the algorithm for constructing general linear network error 
correction codes, including network MDS codes. In Section V, 
we analyze the performance of random linear network error 
correction codes . The last section summarizes the works done 
in this paper. 

II. Basic Notation and Definitions 

In this paper, we follow lfl2l in its notation and terminol- 
ogy. A communication network is defined as a finite acyclic 
directed graph G — (V,E), where the vertex set V stands 
for the set of nodes and the edge set E represents the set 
of communication channels of the network. The node set V 
consists of three disjoint subsets S, T, and J, where S is 
the set of source nodes, T is the set of sink nodes, and J = 
V — S — T is the set of internal nodes. Furthermore, a direct 
edge e = (i, j) E E represents a channel leading from node i 
to node j. Node i is called the tail of e and node j is called the 
head of e, written as i — tail(e), j ~ head(e), respectively. 
Correspondingly, the channel e is called an outgoing channel 
of i and an incoming channel of j. For a node i, define 
Out(i) — {e E E : e is an outgoing channel of i}, In(i) — 
{e E E : e is an incoming channel of i}. In a communi- 
cation network, if a sequence of channels (e\,e2,"- ,e m ) 
satisfies tail(ei) = i, head(e rn ) = j, and tail(ek+i) = 
head(ek) for k = 1, 2, • • • , m — 1, then we call the sequence 
(ei, e2, • • • , e TO ) a path from node i to node j, or equivalently, 
a path from channel e\ to node j. For each channel e E E, 
there exists a positive number R e called the capacity of e. We 
allow the multiple channels between two nodes and assume 
reasonably that the capacity of any channel is 1 per unit time. 
This means that one field symbol can be transmitted over a 
channel in one unit time. A cut between node i and node j 
is a set of channels whose removal disconnects i from j. For 
unit capacity channels, the capacity of a cut can be regarded 
as the number of channels in the cut, and the minimum of all 
capacities of cuts between i and j is called the minimum cut 
capacity between node i and node j. A cut between node i 
and node j is called a minimum cut if its capacity achieves 
the minimum cut capacity between i and j. Note that there 
may exist several minimum cuts between i and j, but the 
minimum cut capacity between them is determined. The source 
nodes generate messages and transmit them to all sink nodes 
over the network by network coding. In the present paper, 
we consider single source networks, i.e., |5| = 1, and the 
unique source node is denoted by s. The source node s has 
no incoming channels and any sink node has no outgoing 
channels, but we use the concept of imaginary incoming 
channels of the source node s and assume that these imaginary 
incoming channels provide the source messages to s. Let the 
information rate be w symbols per unit time. Then the source 
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node has w imaginary incoming channels d[, d' 2 , ■ ■ ■ ,d' w and 
let In(s) — {d[,d' 2 ,--- ,d' w }. The source messages are w 
symbols X = (Xi, X2, ■ ■ ■ ,X W ) arranged in a row vector 
where each Xi is an element of base field T . Assume that they 
are transmitted to the source node s through the w imaginary 
channels in In(s). By using network coding, source messages 
are multicast to and decoded at each sink node. 

At each node i E V—T, there is an x \Out(i)\ matrix 

Ki = (kd, e )d£in(i),eeOut{{) called the local encoding kernel at 
i, where kd. e E F is called the local encoding coefficient for 
the adjacent pair of channels (d, e). Denote by U e the message 
transmitted over the channel e. At the source node s, assume 
that the message transmitted over the ith imaginary channel is 
the ith source message, i.e., U~d> = Xi. In general, the message 
U e is calculated by the formula U e = 2~2dein(tail(e)) k d, e U d - 
As we know from [20 1 [21], the global encoding kernel of a 
channel e is a w-dimensional column vector f e over the base 
field F satisfying U e — X • f e . The global encoding kernels 
can be determined by the local encoding kernels. 

In the case that there is an error in a channel e, the output 
of the channel is U e — U e + Z e , where U e is the message 
that should be transmitted over the channel e and Z e E F 
is the error occurred in e. We treat Z e as a message called 
error message. To explain the approach, the extended network 
was introduced in [12] as follows. In the original network 
G = (V, E), for each channel e E E, an imaginary channel e' 
is introduced, which is connected to the tail of e to provide 
error message. This network G = (V, E) with imaginary 
channels is called the extended network, where V = V and 
E = EU E' U {d[,d' 2 ,-- ,d' w } with E' = {e' : e E E}. 
Obviously, \E'\ = \E\. Then a linear network code for the 
original network can be extended to a linear network code 
for the extended network by letting k e >, e = 1 and k e ',d = 
for all d € E\{e}. For each internal node i in the extended 
network, note that In(i) only includes the real incoming 
channels of i, that is, the imaginary channels e' corresponding 
to e £ Out(i) are not in In(i). But for the source node 
s, we still define In(s) — {d'^d^,--- ,d' w }. In order to 
distinguish two different types of imaginary channels, we call 
d'i (1 < i < w) the imaginary message channels and e' 
for e g E the imaginary error channels. We can also define 
global encoding kernel f e for each e E E in the extended 
network. It is a (w + \E\) -dimensional column vector and the 
entries can be indexed by the elements of In(s) U E. For 
imaginary message channels d[ (1 < i < w) and imaginary 
error channels e' E E', let fd>. = Id'., fe' = le, where Id is 
a (w + I E |) -dimensional column vector which is the indicator 
function of d E In(s) U E. For other global encoding kernels 
f e , e E E, we have recursive formulae: 

fe = ^2 k d.efd + le- 

d£l n{tail{e)) 

We call fe the extended global encoding kernel of the channel 
e (e E E) for the original network. Furthermore, similar to 
the Koetter-Medard Formula [4], there also exists a formula 
El: 

(fe: eEE)=( * )(I-F)-\ 



where B = {kd, e )deln(s),eeE is a wx \E\ matrix with k d , e = 
for e ^ Out(s) and kd >e being the local encoding coefficient 
for e E Out(s), the system transfer matrix F = (kd, e )d£E,e£E 
is an \E\ x matrix with kd, e being the local encoding 
coefficient for head(d) = tail(e) and kd, e — for head(d) ^ 
tail(e), and / is an \E\ x \E\ identity matrix. 

Let Z = (Z e : e E E) be an \E\ -dimensional row vector 
with Z e E T for all e E E. Then Z is called the error message 
vector. An error pattern p is regarded as a set of channels in 
which errors occur. We call that an error message vector Z 
matches an error pattern p, if Z e = for all e E E\p. 

For a channel e € E, if there is no error in it, then 

Ue = (X,Z) • fe = (X,Z) ■ (fe - l e ) - U e . 

If there is an error Z e 7^ in channel e, then 

Ue = U e + Z e = (X, Z) • ( f e ~ l e ) + Z e 

= (X,Z) • (fe - le) + (X, Z) • l e = (X, Z) • f e . 

At a sink node t, the messages {U e : e E In(t)} and the 
extended global encoding kernels {/ e : e E In(t)} are avail- 
able. For all messages including information messages and 
error messages, if they are considered as column vectors, then 
the above discussions describe linear network error correction 
coding in packet networks. 

First, we need some notation and definitions which either 
are quoted directly or are extended from Zhang lfl2ll . 

Definition 1 ( H2\ Definition 1]): The matrix 

F t = (f e : eEln(t)) 
is called the decoding matrix at a sink node t E T. Let 

A t = (U e : eE In(t)). 

The equation 

(X,Z)F t = i t 

is called the decoding equation at a sink node t. 

Definition 2: For an error pattern p and extended global 
encoding kernels / e , e E E, 

• f§ is a (w + I p\) -dimensional column vector obtained 
from f e = (fe(d) : d E In(s) U E) by removing all 
entries f e (d), d ^ In(s)Up, and fg is called the extended 
global encoding kernel of channel e restricted to the error 
pattern p. 

• f§ is a (w + I E |) -dimensional column vector obtained 
from f e = (fe(d) : d E In(s) U E) by replacing all 
entries f e (d), d ^ In(s)Up by 0, and fg is also called the 
extended global encoding kernel of channel e restricted 
to the error pattern p. 

• fg is a (w + I E |) -dimensional column vector obtained 
from f e = (f e (d) ■ d E In(s) U E) by replacing all 
entries f e (d), d E In(s) U p by 0. 

Note that fg + fg c = f e . 

Definition 3 ( H12\ Defintion 3]): Define 

A(i, p) = {(0, Z)Ft '■ all Z matching the error pattern p} 

where is a w-dimensional zero row vector, and Z is an 
dimensional row vector matching the error pattern p; and 

= {(X,0)F f : XeF}. 
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We call A(t, p) the error space of error pattern p and <£>(t) the 
message space. 

Let L be a collection of vectors in a linear space. (L) 
represents the subspace spanned by the vectors in L. In fact, if 
we use rowt(d), d £ In(s) U E to denote the row vectors of 
the decoding matrix F t , then A(t,p) — ({rowt(d) : d £ p}) 
and $(£) = ({row t {d) : d G In(s)}). 

Definition 4 ( H2\ Definition 4]): We say that an error pat- 
tern pi is dominated by another error pattern p 2 with respect 
to a sink node t if A(t, pi) C A(t, p 2 ) for any linear network 
code. This relation is denoted by pi -< t p 2 . 

Definition 5 ( H2\ Definition 5]): The rank of an error pat- 
tern p with respect to a sink node t is defined by 

rankt(p) = min{|p'| : p -< t p'}. 

In order to understand the concept of rank of an error pattern 
better, we give the following proposition. This proposition is 
a slight and necessary modification of [12, Lemma 1]. 

Proposition 1: For an error pattern p, introduce a source 
node s p . Let p = {ei,e2, ••• , e/} where ej <E In(ij) for 
1 < j < I an d define e'j — (s p ,ij). Replace each by e'j 
on the network, that is, add e'i, e 2 , ■ ■ ■ , e[ on the network and 
delete ei, e2, • • • , e/ from the network. Then the rank of the 
error pattern p with respect to a sink node t is equal to the 
minimum cut capacity between s p and t. 

Proof: It is similar to the proof in fl2l . and, therefore, 
omitted. ■ 

Definition 6 ( KT2\ Definition 6]): A linear network error 
correction code is called a regular code if for any t E T, 
dim($(t)) = w. 

Definition 7 ( KT2\ Definition 7]): The minimum distance 
of a regular network error correction code at a sink node t 
is defined by 



d min = mm{rank t (p) : dim(A(*, p) n > 0}. 



For the minimum distance above, we give the following 
proposition. 

Proposition 2: For the minimum distance of a regular net- 
work error correction code at a sink node t, there exist the 
following equalities: 

d<£ in = min{ranh(p): A(t, p) n $(t) ji {0}} (1) 
= min{|p|: A(i, p) n *(t) ^ {0}} (2) 
= min{dim(A(t, p)) : A(i, p) n $(t) ^ {0}}. (3) 

Proof: We define the set of error patterns II = {p : 
A(t,p) n ^ {0}}. Then one has 

(fT| = min rankt (p) , = min|p|, (J3j = min dim(A(t, p)). 
pen pen pen 

Since dim(A(t, p)) < rank t (p) < \p\ for any error pattern 
p C J5, it follows that 

mindim(A(i, p)) < min rankt (p) < min|p|. 

pen pen pen 

In view of the inequalities above, it is enough to prove 
mm pen \p\ < mm pen dim(A(t, p)). Let p' £ II be an error 
pattern satisfying 

dim(A(i,p')) = min dim(A(t, p)). 
pen 



Assume that p' = {ei,e2,--- , e;}, which means A(i, p') = 
({row t (ej) : 1 < i < I}). For {row t (ej) : 1 < i < I}, let 
its maximum independent vector set be {row t (e, . ) : 1 < j < 
m}, where m — dim(A(t, p')) < /. Set pi = {e^. : 1 < j < 
m}. This implies that 

\ Pl | = dim(A(t, pO) = dim(A(t, p')) 



and 



A(t,p 1 )n$(t) = A(f,p')n$(t)^{0}. 



Therefore, 

min |p| < |pi| = dim(A(i,p')) = mindim(A(£, p)). 
pen pen 

The proof is completed. ■ 
In this paper, we always use w to denote the information 
rate and Ct to denote the minimum cut capacity between the 
unique source node s and sink node i, and define St = Ct — w 
which is called the redundancy of sink node t. 

III. The Refined Singleton Bound of NEC and The 
Network MDS Codes 

By using the concept of the extended global encoding 
kernels, we can reprove the refined Singleton bound of NEC. 
First, we give the following lemma. 

Lemma 1: For a regular linear network error correction 
code, let a channel set {ei,e2,- - - J e C t } be a minimum 
cut between s and t with an upstream-to-downstream order 
ei -< e2 -< • • • -< ec't an d let an error pattern p = 
{e w ,e w+ i, • • • , e c J. Then $(t) n A(i, p) ^ {0}. 

Proof: Let X and Z represent the source message vector 
and the error message vector, respectively. Then, for each 
channel e e E, we have U e = (X, Z) ■ / e , where U e is 
the output of e. Let f/ ei = £/ e2 = • • • = V &w _ l = 0. Since 
Rank((/ ei / e . 2 • • • / e „_i)) is at most (w — 1), there exists 
a nonzero message vector X x and an error message vector 
Zli= 0. such that 

(XijZj) ■ (/ ei / e2 ■■• /e„_i) 
= (X 1 ,0)-(/ ei / e2 ••• / eTO _J 

=(C7 ei L> e2 ••• C7 e)o _ 1 ) = 0. 
Moreover, as this code is regular, this implies 

(Xi.fi) ■ (f ei fe 2 ■■■ fe Ct ) = (U ei U e2 ■■■ U eCt ) ^ 0. 

Assume the contrary, i.e., (U ei U e2 ■ ■ ■ U eo ) = 0. And note 
that {ei, e2, • • • , ec t } is a minimum cut between s and i and 
Z_! = 0. It follows that 

A t = (U e :ee In{t)) = 0, 

which implies that (X l5 0)F t = from the decoding equation 
(X_i>Z_i)^7t _ ^ t Therefore, the equality Xi = follows 
from dim($(i)) = w because the linear network error 
correction code considered is regular. This contradicts our 
assumption X : ^ 0. 

On the other hand, there exists another source message 
vector X 2 = fi an d another error message vector Z 2 matching 
the error pattern p = {e^,, e w +%, ■ ■ ■ , ec t }, such that 



(X2> ZL2) ' (/ei fe 



f ect ) = (u ei u e , 



Ue Ct ). 



CONSTRUCTION OF NETWORK ERROR CORRECTION CODES IN PACKET NETWORKS 



5 



And note that Z 2 ^ because (U ei U e2 ■ ■ ■ U ec ) 7^ 0. In 
fact, since e w -< e w +\ -<•••-< ec t , for e £ p, we can set 
sequentially: 

Z e = ?7 e - k die U' d , 

where U' d is the output of channel d in this case. 
Therefore, it follows that 

(Xi,0)-F t = (0,Z 2 )-F t . 

And note that Z 2 matches the error pattern p. It is shown that 
n A(f, p) 7^ {0}. The lemma is proved. ■ 
Theorem 2 (The Refined Singleton Bound): Let d~L be 
the minimum distance of a regular linear network error cor- 
rection code at a sink node t G T. Then 



Remark 1: Conventionally, if a regular network error cor- 
rection code C satisfies the refined Singleton bound with 
equality, that is, d^ in = St + 1 for each t G T, then this 
code C is called network error correction maximum distance 
separable (MDS) code, or network MDS codes for short. 

It is not hard to see that Theorem [2] is an obvious con- 
sequence of Proposition [2] and Lemma Q] Now, we give a 
constructive proof to show that the refined Singleton bound 
is tight. First, we need the following lemma from |12|. 
Define Rt(St) as the set of the error patterns p satisfying 
\p\ = rankt(p) — St, that is, 

Rt(St) — {error pattern p : \p\ = rank t (p) — St}- 

Lemma 3: For each t G T and any error pattern p E R t (St), 
there exist (w + 5 t ) channel disjoint paths from either s or p 
to t, and the (w + 5 t ) paths satisfy the properties that 

1) there are exactly St paths from p to t, and w paths from 
s to i; 

2) these St paths from p to t start with the different channels 
in p. 

Furthermore, in Lemma [3] assign w imaginary message 
channels d'i, d' 2 , ■ ■ ■ , d' w to the w paths from s to t, and assign 
St imaginary error channels e' ,e E p to the <5 4 paths from p to 
i, i.e., for each e E p, assign e' to the path from e to t. This 
leads to the following corollary. 

Corollary 4: For each t E T and any error pattern p E 
Rt(St), there exist (w + St) channel disjoint paths from either 
In(s) = {4,4: • • • , d' w } or p' — {e 1 : e E p} to t, and the 
(w + S t ) paths satisfy the properties that 

1 ) there are exactly St paths from p' to t, and w paths from 
In(s) to t; 

2) these St paths from p' to t start with the distinct channels 
in p 1 and for each path, if it starts with e' E p', then it 
passes through e E p. 

Theorem 5: If |.F| > 2~2t£T \Rt(St)\, then there exist linear 
network error correction MDS codes, i.e., for all t E T, 



Proof: Let G — {V, E} be a single source multicast 
network, where s is the single source, T is the set of sink 



nodes, J = V — {s} — T is the set of internal nodes, and E 
represents the set of channels in G. Let G — (V, E) be the 
extended network of G. For each t E T and each p E Rt{5t), 
Vt, P denotes the set of (w+S t ) channel disjoint paths satisfying 
Corollary |4] Denote by E t . p the set of all channels on paths 
in V t ,p- 

Now, we define a dynamic set of channels CUT tyP for each 
t E T and each p E Rt(St), and initialize 

GUTt. p - In(s) Up' = ■•• ,d'J U {e' : e E p}, 

where e' is the imaginary error channel corresponding to e. 
Initialize fd=Q. for all d E E and fd = Id for all d E 
In(s)UE'. Naturally, we are interested in {fd : d E CUT t , p }. 
For any subset B C In{s) U E' U E, define 

C(B) = ({/„ : e E B}),£"(B) = ({# : e E B}), and 
/y>(B) = ({/£ : e e B», = ({/f : e e £?}). 

For CUT t , p , note that the initial set is CUT t:P = In(s)Up', 
which means 

C(CUT tiP ) = ({f d : d E In(s) U p'}) 

= ({l d :dEln(s)U{e':eEp}}). 

Thus (f p d :dE CUT tiP ) = (f d :dE In(s) U p') is an 
identity matrix of size (w + S t ) x (w + S t ). That is, 
Rank((/ d p : d E In(s) U p') ) = w + S t or dim(£"(Cf7Ti i/5 )) 
= u; + S t . 

Next, we will update CUT t ^ p in the topological order of all 
nodes until CUT tiP C i"n(i). 

For each i E V, consider all channels e E Out(i) in arbi- 
trary order. For each e € Out(i), if e ^ Uter U pe ^ t (5 t ) _E tiP , 
let / e = l e , and all GUT t . p remain unchanged. Otherwise 
e E Uter ^peRt{&t) E t, P > i- e - e G S t)P for some i G T and 
/? G Rt{St)- In T^t^, we use e(t,p) to denote the previous 
channel of e on the path which e locates on. Choose 

g e E C(In(i) U {e'})\ (4) 
U KT LW t(Jt) : [£ p (C7(7T t . p \{e(i,p)}) + £ pc (/n(i) U {e'})], 

where the addition "+" represents the sum of two vector 
spaces. Further, let 

7 _ f ,9e + le if <?e(e) = 0, 

1 g e (e) _1 ■ g e otherwise. 

For those CUT ttP satisfying e G E t , p , update CUT t . p = 
{CUT t , p \{e(t, p)}} U {e}; and for others, CUT t , p remain 
unchanged. 

Updating all channels in E by the same method, one can 
see that all f e ,e E E are well-defined and, finally, CUT t . p C 
In(t) for all f G T and p € R t (S t ). 

To complete the proof, we only need to prove the following 
two conclusions: 

1) For each t E T, d® ia = S t + l. 

2) There exists nonzero column vector g e satisfying (0J. 
The proof of 1): We will indicate that all CUT t , p satisfy 

d\\j\(C p (CUTt :P )) — w+5t during the whole updating process 
by induction. 
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Assume that all channels before e have been updated and 
dim(£ p (CUT t , p )) = w + S t for each CUT t . p . Now, we take 
the channel e into account. Since we choose 

g e e £(In(i) U {e'})\ 

U teT U„ 6W , [£P(CUT ttP \{e(t,p)}) + £P°{In{i) U {e'})], 

it follows that gP and {f p d : d G CUT t , p \{e(t, p)}} are lin- 
early independent for any CUT ttP with e € E t:P . Conversely, 
suppose that g P and : d G CUT t , p \{e(t, p)}} are linearly 
dependent. This means that g P is a linear combination of 
vectors in {/£ : d G CUT t ^ p \{e(t, p)}}. And g P ° is a linear 
combination of vectors in {/? : d G U {e'}} because 

of g e G C(In(i) U {e'}). Therefore, g e — g P + g P is a linear 
combination of vectors in 

{/ d p : d G CUT tiP \{e(t,p)}} U {/f : d € Jn(i) U {e'}}. 

This is a contradiction to the choice of g e . 

In the following, we will show that fP and {/j? : d € 
CUT t>p \{e(t, p)}} are also linearly independent. 

• If <7e(e) 7^ 0, then, since and {f? : d G 
CUTt iP \{e(t, p)}} are linearly independent, = 
^(e)" 1 • g§ and {/£ : d e d7T tl/ ,\{e(t, p)}} are also 
linearly independent. 

• Otherwise p e (e) = 0. We claim that e ^ p. Assume the 
contrary, i.e., e £ p. Thus e(t, p) = e' which means 
f <ttP) = l e and f d (e) = for all d G CtfTyUeft p)}. 
Together with g e (e) = and dim(£ p (CUT ttP )) = w+S t , 
it follows that g P is a linear combination of vectors 
in {fP : d € C7[/T i . p \{e(t, / 9)}}. This imphes that 
g e € £ p (C77Ti, p \{e(i,p)}) + £P° (In(i) U {e'}), which 
leads to a contradiction. Hence, in view of e ^ p, one 
obtains g P = fP, which implies that fP and {/? : d G 
CUTf p \{e(t, p)}} are linearly independent. 

Finally, after all updates, we have CUT t , p C 7n(t) for each 
i € T and each p G i? t (<5 t ), and Rank((/f : e G CUT t , p )) = 
u> + <5t. As the matrix : e G CUT ttP ) is a submatrix of 
pP ± (fP : e € 7n(i)) with the same number of rows, it 
follows that Rank(Ff) = w + S t , i.e., n A(*,p) = {0}. 

For each error pattern 77 C E satisfying rankt(r]) < St, 
there exists an error pattern p G Rt(St) such that 77 ~< t p from 
Proposition Q] This implies that A(t, 77) C A(i, p), and thus, 

$(t) n A(i, 7?) c n A(t, p) = {o}. 

Now, we can say that d|^] n > 5 t + 1 for all /j G T, which, 
together with d^*J n < St + 1 from Theorem [2] shows that 

<£L = <5t + l for all t e T. 

The proof of 2): We just need to prove that if {J 7 ] > 
£ teT \Rt(S t )\, then 

£(7n(i)U{e'})\ 
U teT U peRt(5t ): [£P(CUT t , p \{e(t, p)}) + C pc (In(i) U {e'})] 
> 0. 

Let dim(£(In(i) U {e'})) = fc. For each t G T and 
p G Rt(8t), if e G i? t ,p, then e(t,p) G 7n(i) U {e'}, i.e., 



f e (t,p) e £(In(i) U {e'}). Moreover, we know /g^^ ^ 
£P(CUT ttP \{e(t,p)}), i.e., / e " (t p) ^ £/> (CUT t)P \{e{t, p)}), 
and / e " (t p) ^ (J»(i) U {e'». Together with p) G 

£P c (In(i) U {e'» and / e(i)P) = / e p (i p) + / L p (t p) , this implies 
that 

f eM i £P(CUT t , p \{e(t, p)}) + £ p \ln{i) U {e'}). 
Therefore, 

dim (£(In(i) U {e'})n 
[£P(CUT t , p \{e(t, p)}) + £P C (Jn(i) U {e'})]) < fc - 1. (5) 

Consequently, 

|£(/7i(i)U{e'})\U teT U pe « t (, t ): 

[£P(CUT ttP \{e(t,p)}) + £P C (In(i) U {e'})]| 
= \£(In(i) U {e'})| - |£(Jn(i) U {e'» n {U i£T U pe « t(5t) : 

[r'(CTTT tt ,\{e(t ) p)}) + £ pC (/n(7) U {e'})]}| (6) 

t6T p e-R t (<5«) 

where the last step follows from |.F| > Y^teT P° r trie 

inequality © > Q, it is readily seen from © that © > 0. 
It suffices to show |6]) > ((7|). It is not difficult to obtain that 
© = 0, i.e., 

|£(7n(i) U {e'})n 
{U teT U P ei tt( s t y.[£ p (CUT t ^\{e(t,p)})+£P c (In(i)U{e'})}}\ 

= EE t|P 

teT P £_R t (5 t ) 
if and only if \T\ = 1, |i2 t (<5 t )| = 1 and 

dim(£(7n(i) U {e'})n 
[£"(CI7T t)/ ,\{e(t, p)}) + £ pC (In(i) U {e'})]) = k - 1 

with e G £t,p, where Rt(St) — {p}- However, it is impossible 
that |i? t ((5 t )| = 1 because St < Ct- The proof is completed. 

■ 

According to the known results, for the existence of the 
network error correction MDS codes, the size of the required 
base field is at least J2teT ('?')■ ®y Theorem [5] we can say 
that LteT 1-^*0^)1 is enough. 

For any channel e € 75, if there exists a path from e to sink 
node t, then we call that e is connective with t. 

Lemma 6: Let E t be the set of channels which are connec- 
tive with sink node t G T. Then 

t£T t£T v 1 ' teT v 7 

Moreover, the necessary condition of the second inequality 
holding with equality is that there exists only one sink node 
in the network, i.e., \T\ = 1. 
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Proof: Both inequalities are clear, and we will only 
consider the necessary condition of the second inequality 
holding with equality. Suppose that there are more than one 
sink node, and let t and t' be two distinct sink nodes. 
Obviously, there exists a channel e with head(e) = t'. That 
is, e is not connective with sink node t. This implies that 
\E t \ < \E\, and thus ('f* 1 ) < (If), which shows that 

Eter Of' 1 ) < Eter ©• The lemma is proved. ■ 
From Theorem [5] and Lemma [6] we get the following 
corollary. 

Corollary 7: If IJ 7 ] > J2teT \s)> tnen there ex i st linear 
network error correction MDS codes, i.e., for all t G T, 



Example 1: Let G be a combination network [21, p. 450] 
Il20l p.26] with N = 6 and k = 4. That is, G is a single 
source multicast network, where there are N — 6 internal 
nodes, and one and only one channel from the source node 
s to each internal node. Arbitrary k = 4 internal nodes are 
connective with one and only one sink node, which implies 
that there are total (j) = 15 sink nodes. Thus, for G, we know 
that | J\ = 6, \T\ = (I) = 15, and \E\ = 6 + 4 x (£) = 66. 
It is evident that the minimum cut capacity Ct between s and 
any sink node t is 4. For example, Fig.[T]shows a combination 
network with N = 3, k = 2. Furthermore, let the information 




Fig. 1. Combination Network with N = 3, k = 2. 

rate be w = 2, and thus S t — 2 for each t G T. Therefore, 
1^(^)1 = |i? t (2)| = 4 x (*) = 24 for each t G T, and 
Eter \ R t( s t)\ = 15 x 24 = 360. Nevertheless, £ teT ('?*') = 
15 x («) = 420 and J2teT (?) = 15 X ( 6 2 6 ) = 32175. ' 

Now, we take into account the general network error cor- 
rection codes, and give the following theorem. 

Theorem 8: For any nonnegative integers (3 t with f3 t < St 
for each t G T, if IJ 7 ! > EteT l-^*(A)l> th en there exist linear 
network error correction codes satisfying for all t G T, 

d ( ± > fit + 1, 

where Rt(fit) is the set of error patterns p satisfying |p| = 
rankt(p) = /3t, that is, 

^t(A) = {error pattern p : \p\ = rank t {p) = f3 t }. 

The proof of this theorem is the same as that of Theorem 
|5]so long as replace St by fit, so the details are omitted. 



The following conclusion shows that the required field size 
for constructing general linear network error correction codes 
is smaller than that for constructing network MDS codes. 

Theorem 9: Let ft < St < [%-\, then \R t (Pt)\ < \Rt(S t )\- 

The proof of Theorem [9] is in Appendix [A] 

IV. The Constructive Algorithm of Linear 
Network Error Correction Codes 

From the discussions in the last section, we propose the 
following Algorithm [T] for constructing a linear network error 
correction code with required error correction capability. 



Algorithm 1 The algorithm for constructing a linear network 
error correction code with error correction capacity d^ in > /3 t 

for each teT. 

Input: The single source multicast network G = (V, E), the 
information rate w < mintgT Ct, and the nonnegative integers 
I3 t < S t for each t G T. 

Output: Extended global kernels (forming a linear network 

error correction code). 

Initialization: 

1) For each i e T and each p G Rtifit), find (w + /3 t ) 
channel disjoint paths Vt, P from In(s) or p' to t 
satisfying Corollary HI 

2) For each t G T and each p G i?t(/3 t ), initial- 
ize dynamic channel sets CUT t}P — In(s) U p' 
= {d[,d2, ■ ■ ■ , d' w } U {e' : e G p}, and the extended 
global encoding kernels f e = l e for all imaginary 
channels e G In(s) U E' . 

1: for each node i G V (according to the topological order 
of nodes) do 

2: for each channel e G Out(i) (according to an arbitrary 
order) do 

3: if e <£ U teT U P £R t (fl t ) E t , P then 

4: fe = le, 

5: all CUTt^p remain unchanged. 

6: else if e G U te T^U peflt(( g t) E t . p then 

7: choose g e G C{In{i) U {e'})\ U t£ T UpeH t o t ): 

[^(c7C/T t , p \{e(t,p)}) + £P c (In(i) U {T})], 
8: if g e (e) = then 

9: / e = .g e + l e , 

10: else 

11: fe = 9e(e)~ 1 -g e - 

12: end if 

13: For those CUT ttP satisfying e G -Et^, update 

C7C/T t , p = {C7C/r t , p \{e(t,p)}} U {e}; and for 
others, CUT t , p remain unchanged. 

14: end if 

15: end for 

16: end for 



Remark 2: Similar to the polynomial-time algorithm for 
constructing linear network codes in J3J, our algorithm is 
a greedy one, too. The verification of Algorithm [T] is from 
the proof of Theorems [5] and [8] In particular, if we choose 
fit — St for all t G T, then, by the proposed algorithm, we can 
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construct a linear network error correction code that meets the 
refined Singleton bound with equality. That is, we can obtain a 
linear network error correction MDS code. On the other hand, 
if we choose f3 t — for each t E T, then this algorithm 
degenerates into an algorithm for constructing linear network 
codes. 

Next, we will analyze the time complexity of the proposed 
algorithm. First, from 0, we can determine Rt(Pt) and find 
(w + f3 t ) channel disjoint paths satisfying Lemma [3] in time 
0(E teT ( l S)(w + l3t)\E\). 

Both methods presented by Jaggi et al. |5] are used to 
analyze the time complexity of the main loop. 

« If we use the method of Testing Linear Independent 
Quickly (5J III, A], the expected time complexity is at 
most 



O \E\ 



Y,\Rt{pt)\{v> + pt)b 



teT 



After a simple calculation, the expected time complexity 
of the algorithm using the method of Testing Linear 
Independent Quickly is at most 



0{\E\(w + /3 t ) 

■ E 



teT 



\E\ + l. 



teT 



If we use the method of Deterministic Implementation 1 5 
III,B], the time complexity of the main loop is at most 



e 3 \ 




Fig. 2. Network Gi. 



and 





_ rp(«t) 
- Vt.pi 


= (ei,ei,e 3 ),P tj(9l ' 


= (d\e 2 )}, 




= {d',e[, 


ei,e 2 ,e 3 }; 






_ rp(St) 
~ ^ r t, P i 


= (e' 2 ,e 2 ),pff 2 = 


(d',e 1 ,e 3 )}, 




= {d',e' 2) 


ei,e 2 ,e 3 }; 






_ rp(St) 
- Vt.pa 


= (e£,es),Pfi2 = 


(d',e 2 )}, 


E t,P3 


= {d',e' 3 , 


e2,e 3 }. 





Let the base field be Z 3 . Initialize the dynamic channel 
sets CUT ttPl = {d',e[}, CXJT t . P2 = {d',e' 2 }, CUT ttP3 = 
{d! , e 3 }, and 



fd' — I o J ' fe'i ~ I ) ' — ( 1 ) ' /' 



s 


1 ' 



which leads to dim(£« (CUT t , Pi )) = 2, (i = 1, 2, 3). 
For the channel e\ G Out(s), e\ G E t:Pi fl E ttP2 and 



(| E |,„, + !iti, 



EW(MI) 2 + £W(A)I(» + A) 



teT 



teT 



Therefore, the total time complexity of the algorithm 
using the method of Deterministic Implementation is at 
most 



0(\E\ 

\(£\R t {p t )\) 2 {i 



\E\ 



teT 



-E 

teT 



As an example, we will apply Algorithm Q] to construct a 
network MDS code for a very simple network G\ shown by 
Fig.H 

Example 2: For the network G\ shown by Fig. |2j let the 
topological order of all nodes be s -< i -< t, and the topological 
order of all channels be ei -< e 2 -< e 3 . It is obvious that C t = 
2. Let w — 1, and thus 8 t — C t ~ w = 1. Furthermore, we 
have R t {S t ) = R t (l) = {pi = {ei},p 2 = {e 2 },/Q 3 = {e 3 }}, 



e U)\[£ p KK}) + ^ (K, ei})] 

U[£"=({e 2 })+£^({d',e' 1 })] 



o/ \o 



So we choose g ei 



l)H(l 



because of 



~g ei G £({d', e'J)\[£« ({d'» + £^ ({<?, e'J)] 

U[£^({e 2 })+£^({d',e' 1 })]. 

And f ei = g ei , since g ei (ei) = 1. Then update CUT t:Pl = 
{d',ei}, CUT t)P2 — {ei,e 2 }, and CUT ttP3 remains un- 
changed. 

For the channel e 2 G Out(s), e 2 G E tiPl n E ttP2 fl 
and 

£(K,e 2 })\[£^({ e i})+£ pJ (K, e2 })] 
U [£"({ ei }) +£^(K,e' 2 })] U [£^({e 3 }) +£^(K,e 2 })] 
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We choose g &2 — 



- I o 





since 



3e 2 £ C({d', e 2 })\[C»> ({ ei }) + £rf (K, e 2 })] 
U[^({ei}) + ^({d',e^)]U[£^({ea)+^(K,e^})], 

which, together with g e2 (e 2 ) = 1, shows that / e2 = g e2 . 
Then, update CUT t Pl = {e 2 ,ei}, CUT t P2 = {ei,e2}, and 

For the channel e 3 e Out(i), e 3 £ E tyPl r\Et iPa r\E t , P3 and 

£({e 1 ,ea)\[^ 1 ({e 2 })+£^({e 1) ea)] 
U [#» ({e 2 }) + ^ ({ ei , e 3 })] U [C^ ({e 2 }) + C»* ({e x , e 3 })] 



U 



u 



We select g 63 = 



- I | 



satisfying 



g e3 e £({ ei , e 3 })\[£« ({e 2 }) + £^ ({e ls 4»] 
U[£"=({e 2 })+^ ({ei)e ^ })]u[£ P3 ({e2})+£ P§ ({ei)e ^ })] . 

It follows that / e3 = g e . 3 from g e3 (e3) = L an d update 

CUT t , pl = CUT t>P2 = CUT tiP3 = {e 2 , e 3 } C Jh(t). 



The decoding matrix at t is F t = (f E2 f E3 ) = ' 1 



1 o 
1 



It 



is easy to check that n A(t, pt) = {0} for i = 1,2,3. 
Further, let p = {ei,e 2 }. Then rank t (p) — 2 and $(f) n 
7^ {0}, which means d£] n = 2 = (5* + 1. That is, 
{/ei > /e 2 5 /e 3 } forms a global description of a linear network 
error correction MDS code for the network G\. 



V. Random Linear Network Error Correction 
Coding 

Random network coding was originally proposed in 0. 
When a node (maybe the source node s) receives the messages 
from its all incoming channels, for each outgoing channel, it 
selects the encoding coefficients uniformly at random over the 
base field F, uses them to encode the messages and transmits 
the encoded messages over the outgoing channel. In other 
words, the local encoding coefficients fe^.e are independently, 
uniformly distributed random variables on the base field T '. 
The performance analysis of random linear network coding is 
very important in theory and applications. In this section, we 
will investigate the error correction capability of random linear 
network coding. We first consider random linear network error 
correction MDS codes. Before the discussion, we give the 
following definitions. 

Definition 8: Let G be a single source multicast network, 
C be a random linear network error correction code on G, and 
^min ^ e me minimum distance at sink node t of C. 

. P ec (t) = Pr({dim(*(f)) < w} U {d^\ n <S t + 1}) is 
called the failure probability of random linear network 
error correction MDS coding for sink node t. 



< 



P ec = Pr({C is not regular} U {3 t G Tsuch that d\ 
St + 1}) is called the failure probability of random linear 



network error correction MDS coding for network G, 
that is the probability that network MDS codes are not 
constructed by the random method. 
In order to evaluate these two failure probabilities, the 
following lemma is useful. 

Lemma 10 ( fi8\ Lemma 1 ], K22\l): Let C be an n dimen- 
sional linear space over a finite field T, Cq, C\ be two 
subspaces of C of dimensions fco, fci, respectively, and 
(Cq U C\) = C. Let li, I2, • • • , l m (m = n — fco) be m 
independently and uniformly distributed random vectors taking 
values in C\. Then 



Pr(dim((£ U{Zi, h, 



lm})) 



Theorem 11: Let G be a single source multicast network, 
and w < vam t £T Ct- Using random method to construct a 
linear network error correction MDS code, then 

• for each t € T, the failure probability of random linear 
network error correction MDS coding for t satisfies 



Pec(t) < 1 - 1 



\MSt)\ 
l-Fl-l 



the failure probability of random linear network error 
correction MDS coding for the network G satisfies 



Pec < 1 



1 



EteT\MSt)\ 



where J is the set of the internal nodes in G. 

Proof: For the single source multicast network G = 
(V, E), s is the single source node, T is the set of the sink 
nodes, J = V — {s} — T is the set of the internal nodes, and 
E is the set of all channels. Let G — (V, E) be the extended 
network of G. 

For each sink node t G T and each error pattern p £ R t (St), 
Corollary [4] implies that there are (w + S t ) channel disjoint 
paths from either In(s) or p' to t satisfying the properties 
that (1) there exist exactly St channel disjoint paths from p' to 
t, and w channel disjoint paths from Jn(s) to t; (2) each of 
these St paths from p' to t starts with a channel e! £ p' and 
passes through the corresponding channel e £ p. Denote by 
V t ,p me set of (w + S t ) channel disjoint paths satisfying these 
properties and E t , p denotes the set of all channels in Vt, P - 

Note that the event "{dim($(t)) = w} n {ri ffi n = S t + 
1}" is equivalent to the event "{dim(<f>(<)) = w} n {V p £ 
Rt(St) : $(t) n A(i, p) = {0}}", and furthermore, the event 
"V p £ Rt(St) ■ Rank(F t p ) = w + St" implies the event 
"{dim($(i)) =w}n{V P £ R t (S t ) : fl A(i, p) = {0}}". 
Thus, we consider the following probability: 

Pr(n peRt(St) R^k(F t p ) =w + S t ). 

For the network G, let an ancestral order of nodes be 



s -< i x < i 2 -< • • • -< 



-< T. 



During our discussion, we use the concept of cuts of the 
paths similar to the dynamic set CUT ttP as mentioned above. 
The first cut is CUT t ^ pfi = In(s) U {e' : e £ p}, i.e., the 
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w imaginary message channels d[ , d' 2 , ■ ■ ■ ,d' w and imaginary 
error channels corresponding to the channels in p. At node 
s, the next CUT t . p .i is formed from C[/T t)P) o by replacing 
those channels in {In(s) U{e':e€ Out(s)}} H CUT t , pfi by 
their respective next channels in the paths. These new channels 
are in Out(s) n Et lP . Other channels remain the same as in 
CUTt^ P fi. At node i\, the next cut CUT tyP ^ is formed from 
CUT t:P: \ by replacing those channels in {In(i\) U {e' : e G 
Out(ii)}} n CUTt tPl i by their respective next channels in 
the paths. These new channels are in Out{i\) fl E t:P . Other 
channels remain the same as in CUTt lPt i. Subsequently, once 
CUTt. p .k is defined, CUTt tfly k+i is formed from CUTt tPt k 
by the same method. By induction, all cuts CUTt tPt k for 
t G T, p € R t (S t ), and k = 0, 1, 2, • • • , \J\ + 1 can be 
defined. Moreover, for each CUT t . p .k, we divide CUT t , Pt k 
into two disjoint parts CUT^ fe and CUT°^ k as follows: 

C^T*™ = {e : e G Cf/T t , M n In(i k )}, 
CUT t °Xk = {e = e G CUT tiP , k \CUT t %}. 

Define (to + <5 f ) x (w + <S t ) matrix P p(fe) = (/£ : e G 
CUT tiPtk ) for fc = 0, 1, ■ • • , | J| + l. If Rank(P p(fc) ) < iu + <5 t , 
we call that we have a failure at CUT t ^ p ^k- Let r^, t p ' represent 
the event "Rank(F p ^ fc ' ) ) = u> + (5 t ". Furthermore, let \ J\ = m, 
[ ' ^ is a submatrix of F p . It follows that 



and note that F( 

the event "Vp G P f (<5 t ), Rank(F p(m+1) ) = w + 8" implies 
the event "Vp G P t (<5 4 ), Rank(F p ) = to + <J t ". Therefore, 

l--Pec(i) 

=Pr({dim($(i)) =w}n {d { ± =5t + 1}) 

=Pr({dim(*(t)) = w } n {n peflt(St) $(t) n A(t, P ) = {0}}) 

>Pr(n pei? . t(5t) Rank(F p ) = u, + S t ) 

>Pr(n peRt{5t) T^\). 

Consequently, 



Pr(n 



p(t.p) 



i) 



>Pr(n peRt(St) T^\\ n peRt(St) T^) ■ 



■ n pefi t (<5 t ) r o*' P) ) 



Pr(n peJ?t(5t) rf ,P) I n p6flt(4t) r^iM"^^^ ,w ) 



= fl ^(n pe fl t(5t ) r i+i I n„ eBtW rf p) ), 



(8) 



fc=0 



where ([8]l follows from 

Pr(n peRtiSt) T^) 
=Pr(n pei?t(5t) Rank((/f : e G Jn(a) U p')) = w + 6 t ) 
=Pr(R&nk(I w+St ) = w + 6 t ) = l. 

For each channel e G E, let e G Out(i k ), Let <7 e be 
an independently and uniformly distributed random vector 
taking values in C(In(i k )), In other words, if In(i k ) = 
{di,d 2 , ■ ■ ■ ,di}, then 

where kd .. e (j = 1, 2, • • • , I) are independently and uniformly 
distributed random variables taking values in the base field T . 



It follows that gP = k du J^ + k d2 , e f^ ? + ■ ■ • + k dl , e f^ is 
also an independently and uniformly distributed random vector 
taking values in £ p (In(ik))- We always define f e — g e + l e . 
Therefore, for all e G E t , p C\Out{i k ) with e(t, p) G CC/T™ fc , 
i.e., e ^ p, it is shown that = because of e ^ p. Thus, /f 
is an independently and uniformly distributed random vector 
taking values in C p (In(ik))- Otherwise e G E t . p H Out(i k ) 
with e(t,p) G CUT° u p \, that is, e(t,/>) = e', then, f§ and 
{/j : c? G CUTt tPt k\e(t, p)} are always linearly independent, 
since fP(e) = 1 and / p (e) = for all d G CUT t , p ,k\e(t, p). 
Applying Lemma [TUl we derive 

\CUTl^ k \ . 
p r(r (M |r (M )= JJ ^_ 1 



w+S t 



> 



nfi-i4)>i-E^ 



> 



_ V— l 



Consequently, for each fc (0 < k < m), one has 

Pr(n peflt(5t) r^ 1 ) |n peflt(5t) r^ ) ) 



=i - Pr(u peRt(St) T^\n peRt(St) ri M) ) 
>i- E p r(r^f\n peRt{St) r^) 

pefit(<5 t ) 

=i- E ^(r&fir^) 

pefl*(5*) 



= 1 



p€Rt(S t 

\Rt(6t)\ 



Combining the above inequalities, we have 

m 

i - Pc(t) > n ^(n^^jriSl n P^ t (^) r i t,p) ) 



> 1 



That is, 



Next, 



\Rt(S t )\ 



p ec (t) < i - i 



m+l 



1^(^)1 



m+l 



1 - P ec > Pr(n t eT n peRt(St) Rank(F t p ) = w + S t ) 
>Pr(n t eT ^ P eRt(s t ) T m+n n teT ^ P eR t (s t ) ^m P \ • • • , 

Hter n peflt ( 5t) r[ ) t ' p) ) 
>Pr(n teT n peRt(5t) r^til n teT n peRt(St) T^) 
■ Pr(n teT n peRt{St) r^ p )| n teT npg^^jr^) • • • 
• Pr(n teT n peKt(5t) rf' p) | n teT n peflt(5t) rf p) ) (9) 

= II ^(n t£ T n peiJi(5i) r(*; p) | n teT n peflt(5t) r^ p) ), 



fc=0 
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where © follows from Pr(n teT (^ P eR t (s t ) T^ p) ) = 1. 
Furthermore, for each k (0 < k < m), 

Pr(n teT n peRt{St) r£&>| n teT n p£Rt(St) r^ p) ) 
=i - Pr(u teT u peRt(St) r^ )C | n teT n pei?t( , t) r^ p) ) 

>i-E E ^i+rir^) 
=i-E E [i-^ri^ir^)] 

teT P eR t (s t ) 

>'-E £ 

t£T P ER t (5 t ) 1 1 
E* 6 tIW| 



1 l-F] - 1 ■ 

Combining the inequalities (O and (II Oi l, we have 



(10) 



1 - P Fr > 1 - 



that is, 



P RR < 1 - 1 - 



171-1 



(Tl+l 



(Tl+1 



The proof is completed. ■ 
Applying Lemma [6] to Theorem [TT] we derive the following 
corollary. 

Corollary 12: The failure probability P ec (t) of random 
linear network error correction MDS coding for each t E T 
satisfies 



(\E t \\ 



\J\ + 1 



P ec (t) < 1- 1 - 



<i- i- 



The failure probability P ec of random linear network error 
correction MDS coding for the network G satisfies 



Pec <1 - 1 



<1- 1- 



Jl + l 



However, in practice, we sometimes need general linear 
network error correction codes instead of the network MDS 
codes. That is, we only need the codes satisfying that its 
minimum distance d^ in > ft, where ft < St is a nonnegative 
integer. The part of reason is that usually the field size required 
by general linear network error correction codes is smaller than 
that of network MDS codes. Hence, we should also discuss the 
random method for the general linear network error correction 
codes. Similarly, we define the failure probabilities for random 
linear network error correction codes as follows. 

Definition 9: Let G be a single source multicast network, 
C be a random linear network error correction code on G, and 



At) 



be the minimum distance at sink node t. Define that 



. P ec (t,j3 t ) 4 p r ({dim(*(t)) < w} U {d£L < ft + 1}), 
that is the probability that the code C cannot either 



be decoded or satisfy that the error correction capacity 
d min > A + 1 at the sink node t; 
. P ec (/3 t ) = Pr({ C is not regular }U{3 t e T such that 
d min < Pt + that is the probability that the regular 
linear network error correction codes with g^*- > ft + 1 
cannot be constructed by the random method. 
Using the similar method to prove Theorem [TT] and com- 
bining it with the method to prove the random linear network 
coding with proper redundancy Q Theorem 2], we can get 
the following results. 

Theorem 13: Let G be a single source multicast network, 
the minimum cut capacity for sink node t E T be C t and 
the information rate be w symbols per unit time satisfying 
to < min te T Ct- Using random method to construct a linear 
network error correction code, then 
• for each t E T and ft < St, 

\Rt(M\( s 

Pec(t,(3 t )< 



for the network G, 



PedPt) < E 



6 t -j3t+\J\+l^ 



t£T 



Remark 3: Both Theorems QT| and [T3] above imply that 
these failure probabilities can become arbitrarily small when 
the size of the base field F is sufficiently large. 

Balli, Yan, and Zhang Q used D^ in to denote the minimum 
distance of random linear network error correction code at a 
sink node t E T. Obviously, the refined Singleton bound tells 



us that takes values in {0, 1, 2, ■ 



they studied the probability mass function of Di'.. For a code 



-1}. Furthermore, 

(*) 



with the minimum distance d[ 



W at cir>V nnAo + X. _1_ 1 _ 

min 

is called the degradation of the code at t. Then they presented 
the following conclusions. 

Proposition 3 ( [7. Theorem 4]): For single source multi- 
cast over an acyclic network G, let the minimum cut capacity 
for sink node t E T be Ct, let the information rate be w 
symbols per unit time, let St = Ct — w be the redundancy of 
the code for the sink node t E T. For a given d > 0, the linear 
random network code satisfies: 



Pr&ZL <S t + l-d)< 



( \E\ \(d+\J\ + l\ 
\5t-d)\ |J| ) 



(|^| -l)*fl • 

Furthermore, the probability that random linear network code 
has minimum distance at least St + 1 — d at all sinks t E T is 
lower bounded by, 

Ut-d) ( 



Pr(D^\ n >S t + l-d,VtET) > 1 



E 



m - D d+i 



This proposition can lead to an upper bound on the field size 
required for the existence of linear network error correction 
codes with degradation at most d. 

Proposition 4 ( [7, Corollary 1 ]): If the field size satisfies 
the following condition: 



\P\ >2 + 
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then there exists a code having degradation at most d at all 
sinks t € T. 

In the same way, applying Theorem Q~3] we can also get a 
probability mass function of D^l . 

Corollary 14: For a single source multicast network G — 
(V, E), let the minimum cut capacity for sink node t E T be 
Ct, the information rate be w symbols per unit time satisfying 
io < min t£ T Ct, and S t — Ct — w be the redundancy of the 
code for sink t G T. For a given d > 0, the random linear 
network error correction codes satisfy: 



and 



Pr(D± <St + l-d)< 



Pr(DV>8t + l-d,VteT) 



>i-E 



\R t (S t 



d)\r\T) 



teT 



This corollary also leads to an upper bound on the field 
size required for the existence of linear network error cor- 
rection codes with degradation at most d. On the other 
hand, Theorem [8] shows that the required field size satisfies 
1-^1 — StgT \R(M\- Therefore, we derive the following 
result. 

Corollary 15: If the size of the base field T satisfies the 
following condition: 



\T\ > min 



{J2\Rt(St-d)\, 



\R(S t -d)\ 



\J\ 

\J\ 



then there exists a regular linear network error correction code 
having degradation at most d at all sink nodes t E T. 
When d = 0, it is readily seen that 



d)\ 



< 



2+(|J| + l)£|i?t(<5 t )l 



teT 



=2 



.teT 



\Rt{S t -d)\ 



\J\ 

\J\ 



This means that, for network MDS codes, Corollary [15] 
cannot give a smaller field size required. But, for d > 1, 



the size bounds 2 



(d+\J\ + l\ 



and 



J2t eT \M6t-d)\(° |; 

X^teT \Rt($t ~ d)\ have no deterministic relations. We will 
illustrate this point through the following example. 

Example 3: For network G2 shown by Fig. [3] below, let 
w = 2. Then S t = Ct — W = 2. 




v© 



Fig. 3. Network G 2 with |T| = 1, | J\ = 1, C t = 4. 

• In the case d = 0, it is clear that 

'd+\J\ + 1 



2+ J2\ R t(St-d)\ 
.teT 

=2 + 2|flt(2)| > \Rt(2)\ 



\J\ 



In the case d = 1, a simple calculation gives 
£1^-^1 = 1^(1)1 = 8, 



teT 



and 



2+ £|i? t (<5 t -rf)| 
.teT 

=2 + %/24 < 2 + 5 = 7 
This shows that in this case 



.teT 



d+\J\ + 1 



d+\J\ + 1 
Ul 



<X)| J R t (6t-d)l- 



teT 



Nevertheless, for the network G3 shown by Fig. H] let w 
which shows 5t = Ct — w = 2. 




Fig. 4. Network G 3 with |T| = 1, | J\ = 8, C t = 4. 

• In the case d — 0, obviously, 

'd+|J| + 1 



2+ ^l^-d)! 
.teT 

=2 + 9|i? t (2)| > |i? t (2)| 



In the case d = 1, after a simple calculation, we de< 
that 

J2\Rt(St-d)\ = \R t (l)\ = l2, 
teT 
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and 



\Rt(i) 



d+\J\ + 1 



\J\ 



Therefore, 



^2\Rt(S t -d)\ 



.teT 



2+ (12 x 45)5 > 20. 



d+\J\ + l 



>J2\RSt-d)\. 



teT 



VI. Conclusions 

In this paper, using the extended global encoding kernels 
proposed by Zhang in lfT2l . we can prove the refined Singleton 
bound in network error correction coding more easily, and 
give a constructive proof to show that this bound is tight, that 
is, we construct network MDS codes which meet this bound 
with equality. As a consequence of this proof, an algorithm 
is designed to construct linear network error correction codes, 
especially network MDS codes. The time complexity of the 
proposed algorithm is analyzed. It is shown that the required 
field size for the existence of linear network error correction 
codes can become smaller than the previously known results, 
and even much smaller in some cases. 

For random linear network error correction coding, the up- 
per bounds on the failure probabilities for network MDS codes 
and general linear network error correction codes are obtained. 
And we slightly improve on the probability mass function 
of the minimum distance of the random linear network error 
correction codes introduced in |7|, as well as the upper bound 
on the field size required for the existence of linear network 
error correction codes with degradation at most d. 

Appendix A 
Proof of Theorem[9] 

Proof: We choose an error pattern p\ E Rt{Pt) arbi- 
trarily, that is, the chosen error pattern p 4 satisfies \p\\ = 
rankt(pi) = fit- Then we can extend p\ to an error pattern p[ 
with pi C p[ and \p[\ — rank t (p' 1 ) = Ct, since the minimum 
cut capacity between s and t is Ct- Define two sets as follows: 



and 



toi,p t = {error pattern p C p[ : p e -Rt(ft)} 



Qi,5 t — {error pattern p' C p[ : p' E R t (S t )}- 



From the above definitions, we have 



l«i A I 



ft 



and |fii St \ 



Note that (3 t < S t < |_t|\| implies 



< 



In other 



(ft) ^ U t ; 

words, for each p G fii,s t , there exists an error pattern p' E 
f2i 5 t such that p is covered by p', i.e., p C p', and 6' ^ rf 
for any distinct 9,ij E &i,p t - 



Again, choose an error pattern p 2 E -R t (ft)\f2i « t arbitrar- 
ily. In the same way as for pi, p 2 can be extended to an error 
pattern p' 2 with p 2 C p' 2 and \p' 2 \ — rank t (p' 2 ) = C t - Define 
the next two sets: 

^2, ft = {error pattern p C p' 2 : p E i? t (ft), P £ pi n p' 2 }, 

and 

Sh,5 t = {error pattern p C p' 2 : p g R t (S t ),p pi n p' 2 }. 

Obviously, for all p E ^2, ft and p 1 E 0, 2t s t , we have p ^ 
f2i (J g f and p 1 ^ ^i.s t - This means that f2i,ft H &2,p t — and 
tti,s t n fi 2) i t = 0. Let \p{ flp 2 | = fci >2 . Then 



2,Pt 



ki,2 
St 



We adopt the convention that (?) = for a < b. 

Similarly, we choose an error pattern p% E Rt(f3t)\£li,p t U 
f2 2 ,ft, an d extend P3 to an error pattern p' 3 with p 3 C p' 3 and 
\p' 3 \ = rank t (p' 3 ) — Ct- Define 



«3A = {p C p' 3 : p G Rt(fit),p£ {p'i U p 2 } n p 3 }, 



and 

fis,* = {p' C p 3 : p' G flt(5t), p' £ {pi U p 2 } n p' 3 }. 

We claim that for all p E fia, p t and p' G ^3.5 t , p Q\,p t U 
fi 2 ,ft and p' ^ fii,5 f U &2,s t - Conversely, suppose that p E 
Uf =1 Qj,ft ( res P- P' e u !=iA,5 t )- Together with p G fi 3 , ft 
(resp. p' G ^3,a t ), this shows that p C {pj U p 2 } n p' 3 (resp. 
p' C {p^ U p' 2 } n p 3 ). It contradicts to our choice p E ^3,p t . 
Thus, 3)A n n iA = and fi 3 ,,5 t n Cl iA = 0, t = 1, 2. 
Further, let |{pi U p 2 } n p 3 | = £1,2,3. Then 



in 



3,ft 



fcl. 
ft 



2.3 



and If2 



3,6 t \ 



h,2,3 

St 



Choose an error pattern P4 G i?t(/?t)\uf =1 ili, ( g t , and extend 
P4 to an error pattern p 4 with p 4 C p' 4 and |p 4 | = rank t (p' 4 ) = 
Ct- Define two sets similarly: 



^4,ft = {p c P4 : p g Rt(Pt),p£ {uLip'J n pU, 



and 

n*,*, = y C p 4 : p' G i? t (<5 t ), p' ^ {uf =1 pU n p!>. 

For all p G f2 4 ^ t and p' G ^4,<5 t , P ^ Uj-ifli,^ and p' ^ 
U^jfij^j. Assume the contrary, i.e., p G U? =1 Oj ); g t , which 
implies that p C {p[ LI p' 2 U p' 3 } n p 4 . It is a contradiction. 
Similarly, we have p' ^ Uj—jfij ^ for all p' G ^4,5 t - That 
is, £li,p t n Oj ); g t = and ^4^, n Qi,8 t =0, i = 1, 2, 3. Let 
|{U? =1 p-} n p 4 | = fei l2 .3.4. It follows that 



4,ft 



2,3,4 

^1,2,3,4 

ft 



CA _ Al,2,3,4 



We continue this procedure until we cannot choose a 
new error pattern p G Rt{Pt)- Since \R t (j3t)\ is finite, this 
procedure will stop at some step. Without loss of generality, 
assume that the procedure stops at the mth step. That is, 
Rt(Pt) = U^fi^. Together with what we have proved 
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above, £li t p t H %,/3 t = for all i,j satisfying i ^ j 
(1 < i>j < m )- This implies that 



|i? t (/3 t )|=El^l=E (fl 
i=i »=i LVPt/ 



/St 



where set k\ = 0. Similarly, we also have Sli St fl fij,i t = 
for all i,j satisfying i ^ j (1 < i,j < rri), and U^ =1 Oi )< j ( C 
Rt(St), which implies that 



iwi>£ifi iA i=E 



;.=i 



i=l 



In order to prove |i? t (/3 t )| < \R t (S t )\, it suffices to show 
< |n, A |, i-e., (g) - < (J) - for 

each i = 1, 2, • • ■ , m. 

To simplify the notation, we omit the subscripts in the 
following discussion. It follows that we just need to prove 



that is, 



(id 



where /3 < 5 < [f J and fc < C. 

If fc < <5, the inequality dTTb immediately holds. Otherwise 
k > S, note that 



and 



s)-(°, 

c\ ( c 
s) \S-1 
C \ ( c 
+ 2) U + l 



k\ (k 

k\ ( k 
s) [s-i 

k \ f k 
(3 + 2)~{l3 + l 



C \_f c 
5 - l) \S - 2 

C \ (C 
/3 + lJ U 



5 - l) ~ {§ - 2 

k 

P + l 



+ ■ 



This implies that the inequality ( ITTb holds provided that we 
can show 



C 
a + 1 



C 



> 



a + 1 



for any a satisfying /3 < a < S— 1. After a simple calculation, 
it is equivalent to prove 
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C(C - 1) • • • (C - a + 1)(C - 2a - 1) 
>fc(fc-l)---(fc-a + l)(fc-2a-l). (12) 

It is not difficult to see that the inequality (fl2l) holds for k > S. 
This completes the proof. ■ 



